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SECTION I 



INTRODUCTION 

The MITRE Corporation Policy and Procedures manual (L.l) states, 
"It is MITRE policy to provide a facility and an organization to 
acquire, control, store and retrieve information in the form of books, 
pamphlets, periodicals and reports, needed by MITRE in the prosecution 
of its mission." In implementing this policy the Washington Library 
» recently instituted a Key-Word-Out-of-Context (KWOC) index for the 

technical reports collection. This paper provides some background 
information about the KWOC system; guides to the Library Staff, 
Technical Staff and Administrators on how to use the system; and pro- 
grammer notes including flow charts and program listings for program 
maintenance . 

What is a KWOC Index ? 

A KWOC index is a special form of index in which: (1) an entry 

is created for every substantive word in the report title, and (2) 
entries show the substantive word (keyword) and the portion of the 
context out of which the word was obtained. A sample of a KWOC index 
is shown in Figure 1. 

A KWOC index is a special form of a Key-Word-In-Context (KWIC) 
index, Both KWOC and KWIC are based on the selection of keywords 
. directly from the title alone or from a more complete bibliographic 

citation, 

A more complete discussion of the theoretical background and 
the underlying structure of both KWOC and KWIC indexing is presented 
in Appendix I. 
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Each index entry in The Washington Library KWOC index is 
generated from a technical report citation. The manner in which it 
is generated is shown in Figure 2. After such entries are generated 
for a collection of reports they are sorted and the KWOC output 
listing is generated. In this listing one line is printed for each 
index entry. The line contains the keyword on the left, and in the 
wider field on the right, the context out of which the keyword was 
obtained. 

Is This A Standard KWOC Index ? 

No. The Washington Library has adapted the basic KWOC technique 
to the special circumstances that pertain there. Three adaptations 
have been made. Each is aimed at further improving the usefulness 
of the index in The Washington Operations environment. The adaptations 
are: 

(1) multi-card citation input, 

(2) input tagging, an' 1 

(3) input enrichment, 

These adaptations, along with some of the rationale behind them, are 
discussed below, 
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GENERATING A KWOC INDEX 
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Multi-Card Citation 



A standard KWOC index uses a single card input per report. This 
normally allows the opportunity to input the report accession number 
and about 60 characters of the title. Over half of the technical 
reports received by The Washington Library have titles significantly 
longer than 60 characters. In order to assure the possibility of 
obtaining the full title on input, a multi-card citation input process 
was developed. Details of the process of preparing the input are 
presented in Section II under KWOC Input Processing. 

In addition to assuring input of the full report title, the 
multi-card citation input readily permits the addition of any oth^r 
desired information for indexing purposes. During development oi the 
HfOC index various elements in Washington Operations desired to in- 
clude 8 other reference entries: 

*• Author 

- Corporate Source 

- Corporate Report Number 

• Date 

- Other Identifying Number 

- Department Number 

- Contract Number 

- Task Number 

With the number of input cards substantially unlimited it is possible 
to include all of these and others if this is desired fn the future. 
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Input Tagging 



Tagging is a feature of the Washington IWOC index which was 
developed at MITRE to overcome a basic KWOC format problem. Users c 
both KWIC and KWOC indexes have found it annoying to scan down the 1 
of keywords and find authors' names, department identifiers and ori- 
ginators' document identifications intermixed in thr listing sequenc 
with keywords from titles. This problem is further complicated, in 
the case of The Washington Library, by a requirement to include in tl 
index the many other items of bibliographic information, along with 
titles. Many of these are alphanumuric and would, normally, sort in 
sequence with title keywords in the output listing. 

The solution developed for this problem begins with assigning 
special character tags to be prefixed, on input, to each of the non- 
title items desired in the index. See Table 1. These tags are carrl 
by the system through the keyword sorting process. Since the tags 
are in the high-order (left-most) position for the index sort, they 
force like items together. After sorting, the print program deletes 
the tags fiom the left margin field but leaves the sorted records in 
the sequence in which they were put before the tags were deleted, 

The resulting output is, thus, divided into 10 major segments 
corresponding to the 10 types of tags and the 10 items of interest. 
Title words appear in the subject index section; authors' names all 
appear together and in alphabetic sequence; dates appear together 
and in sequence from earliest to most recent and are not intermixed 
with project numbers, and so on. 
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TABLE I 

ITEM DESIRED FOR CITATION 



ITEM 

Title 

Author(s) 

Corporate Source 
Corporate Report Number 
Date 

AD, PB or NASA Number 

Department Number 

Contract Number 

Project or Task Number 

Accession Number 

Internal documents 
(3 different series) 

External 



EXAMPLE 

Growth of Urban.,. 
Jones, J. P. 

RAND 
P-1234 
680929 
AD 123 456 
D-31 

F19(628)-63-C-0365 

31.5.0 



WP1234 
MTR 456 

M67 

MW 2468 



TAGGED FORM 
Growth of Urban 
-JONES, J. P. 

*RAND 

:P-1234 

/680929 

=AD123456 

.D-31 

+F19(628)-68-C-036 

%31.5.0 

•WP1234 

•MTR456 

•M67 

•MW2468 



NOTES: The black dot • is used in the above illustration as a visible 

substitute for a numerical quantity, hexadecimal 00, for which no 
printable 360 character has been assigned. This quantity is auto- 
matically prefixed to the accession number by the KWOC program. 
When the accession numbers are printed, the unprintable character 
does not print and a blank appears in its place. 
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One bonus of this approach is the ability to reconstitute the 
multiple input card images in their original sequence, at the be- 
ginning of the listing. The basic conditions to permit this are set 
when the accession numbers, treated as keywords, are pulled together 
by their tags and sequenced. Since every citation normally requires 
two to three cards of input, a convention was established to enter 
the accession number first on each card and to suffix card sequence 
letters to the accession number on each card as required. The result, 
after keyword sorting, is the complete citation as originally entered. 

The basic citation section of the listing is bound separately 
from the others and used side-by-side during searches of the other 
portions of the index. This separate list is especially useful when 
the context from which the keyword was obtained (the content of the 
card on which it appeared) happens not to include the other items with 
which the user is trying to coordinate the term he is accessing. See 
Section III for more details on the use of the basic citation listing. 
Input Enrichment 

Another standard objection to conventional KWOC indexing centers 
around the problem of non-informative titles, titles which do not 
contain good, descriptive keywords. As a hedge against lost informa- 
tion, it was decided for this application to enrich the basic KWOC 
citation with human indexing. Enrichment, in this application, is 
defined as the adding of extra index terms which do not appear in the 
title or with the other standard bibliographic data in the document. 

In the input processing stage of the system, all of the basic 
citation information is obtained from the document by a library aide. 




8 



The professional indexer scans this for accuracy and also makes a 
determination as to whether or not the title is properly descriptive. 

If enrichment is deemed desirable, the indexer adds extra terms to 
the input record and suffixes them with an asterisk, The indexer 
is also encouraged, when time permits, to highlight particularly 
descriptive terms in the title. These are also indicated with the 
suffixed asterisk. 

Additional Background Informa tion 

The KWOC index is currently being used with a collection of 
over 10,000 technical reports growing at a rate of r\ proximately 
6,800 per year. Report subject matter is varied representing the 
combined interests of 370 technical staff members working on diverse 
military and civilian systems engineering problems. The rest are 
internally generated. 01: the former, about two-thirds are obtained 
from the Defense Documentation Center (DDC) or the National Technical 
Information Service (formerly the Clearinghouse-CFSTI) . The others 
are obtained directly from the originating organizations. 

The system is based on use of three programs run on the IBM 
360/50 in a multi-programming mode under OS 360, Version 18 with 
HASP (Houston Automatic Spooling Processor, an I/O expediting feature). 
The programs are: KW0C1, which generates the basic index record; the 

Sort-Merge utility program of OS 360, and KWOC 2 which prinus the out- 
put. These three programs are used to prepare and check the newly in- 
dexed material weekly. After checking, newly indexed records are 
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accumulated and, periodically, inserted into the master file with 
the Sort-Merge utility under Job Control Language (JCL) . The mascer 
file is currently printed in its entirety at the same time. 

Weekly cumulative index, sort and print runs for up to 800 
documents require less than 5 minutes of computer time. Master 
file merge and print runs, printing the complete index, have not 
yet exceeded 12 minutes of charged computer time. 
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SECTION II 



PROCESSING IN THE LIBRARY 
How Incoming Reports are Processed 

The flow of technical reports through the Library accessions and 
indexing processes is shown in Figure 3. 

Accessions Process 

Almost all technical reports in the library are obtained at the 
request of members of the MITRE Technical Staff. When requesting a 
report the staff member completes form MCF 1825, 9/70 Figure 4. From 
this form the library takes the bibliographic information, and, if 
the report is not already in the collection, submits an order to the 
appropriate supplier. Request forms are filed in a pending request 
file (ringbinder) until fulfilled by receipt of the report. 

KWOC Input Processing 

When the requested report is received the request form is removed 

* 

from the binder by the library aide. The aide then handwrites such 
information as can be readily obtained from the request form onto a 5x8 
Bibliographic Record Card (sometimes called Shelf List Card). See 
Figure 5. 

The partially completed Bibliographic Record Card is next given 
to the indexer. The indexer checks the aide's work and completes any 
portions of the bibliographic entry which the aide was unable to 



K 

Handwritten entries on the form are currently specified as a work 
saving measure. Since all of the information on the form later 
appears in the computer printed KWOC output, and since the Biblio- 
graphic Record Card is used primarily as a filed record the hand- 
writing of this record constitutes no significant leg i x lity problem. 
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complete. The indexer, at this point, may determine that the contents 
of the document are not adequately described by the terms in the title. 
In such cases the indexer assigns additional terms and records them 
on the Bibliographic Record Card in the appropriate place. Descrip- 
tive terms assigned by the Indexer follow the standards suggested in 
Standard for Descriptive Cataloging of Government Scientific and 
Technical Reports Revision 1 , AD #641092, October 1966. Index terms 
assigned by the indexer are taken from The Thesaurus of Engineering 
and Scientific Terms , First Edition, Engineer's Joint Council, New 
York, December 1967, or later editions as available. The indexer may 
also consult other guides such as the latest edition of the NASA 
Thesaurus . 

Upon completion of this stage of processing the technical report 
is charged out to the requester. Two circulation cards (Figure 6) 
are filed, one by library accession number and one by borrower name. 

The request sheet is filed in a binder of completed requests, by 
requester name. The Bibliographic Record Card is returned to the 
aide who uses it as a source document for transcription of the 
information onto a keypunch coding sheet. The rules for coding 
for keypunching are fully described in Appendix II, Reference Data 
for Coding KWOC Input. See Figure 7 for sample of the completed 
keypunch coding sheet. After coding, the Bibliographic Record Card 
is filed by accession number as a permanent Shelf List record. The 
coding sheets are sent, in daily batches, to the MITRE Washington 
Computer Center (MWCC) for keypunching. 
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FIGURE 7 

CODING AND KEYPUNCHING KWOC DATA 



How KWOC Reports Are Produced 



In producing the arious KWOC outputs two computer procedures 
are used. One of these is usually run once a month and is called 
"Monthly Run." This run consists of two jobs run together. They are 
called CUMULIST, Job 1 and PRINT * Job 2. See Figure 8 for a flow 
chart. The first job accepts as input the cards for all of the new 
accessions during the previous month. It prepares index records for 
these cards and sorts them. It then produces an output listing of 
these records in standard KWOC format. This output is printed on 
white paper for reproduction as a Working Paper and for distribution 

to the Technical Staff as a current awareness service. 

In the second part of the monthly run the sorted index records 
from the first part of the run are merged with the index records 
in the master file on magnetic tape. This job then produces a listing 
of the total KWOC index including all records ever entered. This is 
printed on three part paper (or three times at the Library's option) 
on striped paper. These outputs are used as the current working copies 
in the Library processing area, in the Library reading room and in the 
Bedford Library. 

The other major run is performed once a week and is called "Weekly 
Run." See Figure 9 for a flow chart. This run accepts as input the 
cards representing all accessions since the last monthly run. In the 
first week after the monthly run it operates with one week's accessions 

in the second week, two weeks’ accessions; and so on. This run also 




18 





>9 



FIGURE 8 
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consists of two job steps. The first, KW0C1, indexes all of the in- 
put records, sorts them and creates an output data set for printing. 
The second step, KW0C2 , prints the records in standard KWOC format, 
The output is printed on striped paper. It is used by the library 
aide and the indexer to check the input to KWOC for errors. It also 
serves as the current cumulative accession list supplement until the 
next monthly run. 

Another run called CHECKOUT is used to aid in checking any set 
of input cards which has never been run before. It is very similar 
to the weekly run. See flow chart in Appendix III. It is normally 
used to check the fourth week*s input to the monthly run prior to its 

entry as input to the monthly run. 

The detailed procedures for obtaining the Monthly and Weekly 
Runs are described in Appendix III. 



ERIC 



21 



SECTION III 
HOW TO USE KWOC 

The main purpose of KWOC Is to permit staff members to locate and 
obtain technical reports in the library easily and efficiently. The 
KWOC index can also provide certain valuable information directly with- 
out obtaining any reports. In order to locate a report, the KWOC user 
must: (1) search the KWOC listing for the appropriate entries; (2) 

extract the report accession numbers; and then (3) obtain the desired 
report from the library staff. If the user is Just Interested in ob- 
taining information about reports he needs only to search the index 
and obtain the information directly from the listing. 

Locating a Report - Single Term Search 

When trying to locate and obtain (retrieve) a report from the 
library the staff member may use either a single term search strategy 
or a multiple term search strategy. In a single term search the KWOC 
user must have in mind one of the following: 

• any significant word in the title 

• the name of any of the authors 

• the name of the organisation that produced the report 

• an index term under which the report is likely to have been 
indexed (by the Library indexer) 

• the originating organization's report number 
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• the task number under which the report was produced (MITRE 
documents) 

• The Department number of the originating department (MITRE 
documents) 

• the DDC identification number (AD Number) 

With any one of these the user can scan the listing at the 
appropriate point and locate all the reports indexed under that one 
keyword or term. 

For example, to find a specific item, DDC Report AD681472, a 
searcher looks in that section of KWOC that lists AD number (Figure 
10) • Here he will find that AD681472 has been accessioned under 
MW3210. To order a copy of the document, the searcher merely asks 
the Library staff for MW3210. If he wishes to know the full title 
of the document, the searcher may then turn to the bibliographic 
citation section. 

Locating a Report - Multiple Term Search 

When performing a single term search on the AD number or the 
originating organisation's report number the user will find (if he 
finds anything under that number) only one report for each index entry. 
When searching on other terms the user may find more than one report 
for each term. For example, the KWOC index will list many reports 
under the keywords "COMPUTER" and "SYSTEM." If the user is looking 
for just one report or Just a few reports from among those he can 
ease the search by thinking of one or more other terms under which 
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the report might be indexed and using them in a multiple term search. 



In most cases the user will find the reports he is looking for just 
by looking up the other terms and scanning the index entries associated 
with those terms. A user looking for reports on design of on-line 
systems would do best to search first under the least common of the 
three terms , on-line , and scan the entries for the more common 



words design and system . The user who does this will find entries 



such as: 



ON-LtNE HW4307 
ON-LINE >*4355 
ON-LINE MW7502 
ON-LINE MWB059 
ON-LINE TKS025 
ON-LINE VP9005 
ON-LINE VP9221 
ON-LINE VP9229 



AN ON-LINE DEBUGGER FOR OS/360 ASSEMBLY LANGUAGE PROGRAMS -JOSEPH. V.H. 
DESIGN CONSIDERATIONS FOR AN ON-LINE MANAGEMENT SYSTEM 
DOCUMENT VECTOR MODIFICATION IN ON-LINE IN FORMAT I ON- RETRIEVAL* SYSTEMS 
EVALUATION OF USER REACTIONS TO A PROTOTYPE ON-LINE 

PROPOSED DESIGN FOR AN ON-LINE INFORMATION RETRIEVAL SYSTEM -KINNEY. O.R. 
PRELIMINARY DESIGN NOTES FOR TRANSPORTATION ON-LINE PLANNING TOP) SYSTEM 
TECHNICAL DESIGN APPROACH TO SASM ON-LINE PLANNING SYPPORT -MAYER. R.P. 

22 SO SERVICES PROVIDED BY STACE-2 OF THE MITE ON-l 1NE SYPPORT SYSTEM 



It does not always occur, however, that the several terms the 
user wants to associate in a multiple term search will appear in the 



same index entry. Sometimes they do, in such entries as MW4355, 
TM5025, and WP9005* Some titles are very long, however, and carry 
over to another input card. Entries MW8095, WP9005, and WP9229 carry 
over# Since each card Is treated independently, related keywords may 
appear in separate entries. The key to coordinating such keywords 
lies in making effective use of the report accession number to 



tie them together. 




26 



Suppose a user wants to obtain all reports held on public 



Investment policy . Translated into search terms this means that he 
wants all reports indexed under the keyword public which are also 
indexed under Investment and are also indexed under policy . 

The basic technique used to perform such a search is to enter 
the index with the least likely keyword and list all of the accession 
numbers for entries under that terra. Then, the user goes to the 
next least likely term and looks for entries with accession numbers 
the same as on the initial list, crossing out all that are not 
found among the entries. Then, on the next term for the same 
operation, and on and on until the search is completed. The re- 
maining list of accession numbers represents the set of reports which 
satisfies the search. In the example shown in Figure 11 the user 
has gone first to the keyword investment and has listed all of the 
accession numbers for reports indexed under that keyword, Then, 
after noting these, he has gone to the index entries for the 
keyword public and has crossed off all accession numbers from the 

Investmen t list not found under the entries for public , The two 
remaining entries, MW3893 and MW7893, could then be checked against 

the entries for the keyword policy , but, since there are only 
two it is easier Just to go to the complete citation section of the 
listing and read the complete entries for both documents, 

This basic, list preparation approach can be used for any 
multiple term search. Other logical combinations of terms can be 
used in searching. The search illustrated in the figures required 
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that all terms be present. It used what Is called AND logic (this 
term AND that term AND some other term must all be present). OR 
searches (this term OR that term, etc.) may also be constructed by 
the user. For these, the accession number list obtained under any 
terra is added to the list under the other terms. Multiple term 
searches may be performed using any number of terms, combining them 
in any manner desired. Careful use of this method and careful 
"jotting" of the list can save a lot of time in looking for reports 
in common subject areas. 

The Library indexer has developed a technique for speeding up 
multiple term searches by providing some pre-coordination of index 
terms. This is done by Jinking index terms with hyphens. The title, 
"THE UNIFORM DATA ELEMENT DESCRIPTION," for example, is indexed by 
standard KWOC techniques under UNIFORM, DATA, ELEMENT, and DESCRIPTION. 
The indexer, in the improved procedure, enriches the input by entering 
the additional term DATA- ELEMENT*. This type of linking is also used 
to hold together the sets of character strings which make up the full 
name of the corporate source! (for example: *S0C1ETY-0F- AUTOMOTIVE- 

ENGINEERS) . Multiple term searches of the index can be made more quickly 
when the searcher can enter the index looking for the complete term, 

DATA- ELEMENT*, in one place rather than having to coordinate all of 
the entries for DATA with all of the entries for ELEMENT. 
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Retrieving Information 



The KWOC Index can also be used to retrieve information. For 
example, staff members can obtain a list of all the documents they 
have published by looking in the author section of the KWOC listing. 
See Figure 12. A KWOC user interested in seeing vhat type of reports 
we have been getting from RA\D recently can look under the RAND entry 
in the Corporate Source section of the listing. The user, usually 
with the help of the Library staff, can also prepare bibliographies 
on any single term or any combination of terms. All of the informa- 
tion required can be obtained from the index without retrieving the 
actual reports. 

The possibilities for information retrieval from KWOC are 
limited only by the imagination of the user and the time available 
to search for and compile the information. 
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NtAtfr.S 4* FUNCTIONS 0*" MIPO^p *ANAG C R$ -NFUFY.f - p I FL PF* .f . 

NCBVIITSY^.V.O. *W3**3A .VJTOMATfo SYSTEMS -NEBYLI Tf YN. V.O. MAN/MACMlNF-lNfFPAC T I ON* 

NEtLY.k.b. *WV#*9 CONTEXTUAL ANALYSIS HP PHONEYS OF ENGLISH -RFOOYhP.R. -NFFlY.R-3. 
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SECTION IV 



A PROGRAMMER'S OVERVIEW OF KWOC 

Brief program descriptions and some comments on special features 
are given in this section. Full supporting documentation (flow 
charts, program and procedure listings) is given in Appendix IV, 
Programmer's Handbook for KWOC. 

Basic Programs 

The system is based on use of three programs run on the IBM 
360/50 in a multi -programming mode under OS 360, Version 18 with 
HASP (Houston Automatic Spooling Processor, an I/O expediting 
feature). Detailed flow charts of all programs are contained in 
Appendix IV. Listings are presented in Appendix V, 

The first program, KW0C1, written in assembly language: reads 
input cards; identifies character strings of two characters or more; 
compares these strings against stop list terms: and if the string is 
not on the list (and is, therefore, a KWOC keyword), creates an 
index for the string. These records are in the same format as the 
print line. Each record, In fact, is one full line of the output. 

The KWOC indexing propram requires approximately 300 instructions. 

It presently conta 'i00-word stop list suggested by Williams 

in his book.* Th rogram, when link-edited, requires 46k 

bytes . 



*W. F. Williams, 
The Business Pre 



s of Automated Information Retriev aj. , 
cst , 111., 1965, pp. 126-129. 



The output of KW0C1 is sorted by the Sort-Merge utilities called 
by Job Control Language (JCL) statements. Records are sorted on the 
first 40 characters. This includes the keyword field and the first 
17 characters of the context field. 

The basic print procedure uses the Tape-To-Print utility 
called by JCL, A slightly more sophisticated assembly language print 
program has been written. This program, KW0C2, reads the sorted 
output of KW0C1; deletes records containing certain tags, if suppression 
of those records is desired; deletes tags from the keyword fields; 
and prints using normal pagination controls. This program, when 
link-edited, requires 48k bytes. 

These three programs are used to prepare and check the newly 
indexed material weekly. After checking, newly indexed records 
are accumulated and, once a month, inserted into the master file 
with the Sort-Merge utility under JCL, The master file is printed 
periodically with KWOC2, 

Special Features 

1, Character string separators - any number of blanks, up to 
four, or a comma are interpreted as a break in the input 
character string. Each string of characters between such 
separators is considered a potential keyword to be checked 
against the stop list. More than four blanks separating 
character strings are interpreted as the end of the text on 
the card. 
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2. Stop List - this presently consists of a table of 400 common 
words not useful for indexing. It is possible to extend this 
list (or shorten it) by altering the table in the program. 

No term included in the list will be used as an index term by 
the KWOC program. In addition to terms on the Stop List all 
single character strings are omitted from indexing. 

3. KW0C1 Output record format - this currently appears as 
follows : 

Character Position 



1-20 


21-23 


24 104 


105-125 


Keyword 


Space 


Context 


Space 



Field 

The output of the print program is as follows: 
Character Position 



1-20 


21-23 


24 104 


105-111 


Keyword 


Space 


Context 


Space 



Field 

There is nc requirement for the extra space at the end of the 
records and no significance to the different lengths. 

4. Program-assigned prefix for accession numbers - So that the 
library accession numbers will all fall together and come first 
in the sort sequence, the program prefixes them with a special 
"character". This "character", hexadecimal 00, is lower in the 



EBCDIC collating sequence then blank (hexadecimal 40) or the first 
printable character, the 0 sign, hexadecimal 4A. 

5. Prefix Change for Corporate Source entries - An initial colon 
is replaced with an equal sign under program control so the 
report numbers and Clearinghouse/DDC numbers will be sorted to- 
gether. 

6. Sort Procedure (Job Control Language Statements) - The out- 
put records from KW0C1 are sorted by the sort-merge utilities 

in a single alphabetic disk sort on the first 40 characters. Six 
work areas are used. The output from the sort step consists of 
a set of ordered records that can be output on magnetic tape for 
retention or stored temporarily on disk and used as input to the 
print programs. Because of the special alpha "tags 11 for the va- 
rious data elements, each set of related records is sorted to- 
gether even though only one sort is used. 

7. Basic Print Procedure (Job Control Language Statements) - 
The basic print procedure uses the "tape to print" utility. It 
reads records from the sorted output and prints them, single spaced, 
on the line printer. It is useful for checking over the coded 
input for the updating cycle. It performs no character manipula- 
tion. It skips to a new page only on a print overflow condition. 

8. KW0C2 , a More Sophisticated Print Program - This program 
contains several special printing features which are used to 
improve the readability of the output. These features are; 
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a. Page change - page changes occur on print overflow and 
on changes in initial character, when the initial character 
is one of the special character prefixes. No carriage 
control characters are used in the process. 

b. Prefix deletion - blank is substituted for all initial 
special characters in the output records. This is done 
only for special characters below A in the collating 
sequence. The substitution is performed just prior to 
printing* 

c. Section deletion - complete segments of the output may 
be omitted from the printing by user request, The KW0C2 
input parameter card provides a space to list prefix charac- 
ters, No output record containing the prefix characters 
specified in that field on the parameter card will be 
printed. For example, putting a hyphen and a slash (the 
characters used as prefix for author and date) in the 
parameter card field will cause the author and the date 
listing segments to be omitted. 

d. Dating of Output - the date shown on the KW0C2 input 
parameter card is reproduced at the beginning of each 
listing segment. It is emitted along with the first line 
after each page change caused by an initial special character 
change. 
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9. Holding of old input decks - at present all input cards are re- 
tained in an archive file. The lack of a selective retrieval program 
makes this necessary. In order to produce a listing of any subset 

of the master file (i.e., The Division 3 documents) it would be 
necessary to use the input cards and create a new, special master 
file containing only Division 3 document index records. Printing this 
would provide the desired output. 

10. Retiring of Old KWOCs - from time to time the size of the output 
listing will cause excessively long running times. At this point 

it will be necessary to retire certain records from the master file. 

At present it is planned to retire old KWOC records a year at a time. 
The number of years’ worth while being retained in the active file 
is not yet set but will be at least one and perhaps as high as three. 
For older records one will have to search year by year through earlier 
KWOCs . 

System Limitations 

1. No provision has been made in the system for master file 
maintenance. Records entered into the master file cannot, with 
present KWOC programs, be changec*. This requires a high level 

of quality control on input. It is the primary reason for having 
the Library staff scan the cumulative weekly listings carefully 
to spot the need for corrections in input cards. 

2. The system provides no capability for automatic retrieval 
from the master file. All retrieval is performed manually 
using the listings of the master file and the cumulative updates. 
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3. The number of cards which may be input at any one time is 
limited to approximately 2,000. The sort programs are currently 
set to use 50 contiguous disk tracks for each of six sort work 
areas. Assuming the average number of index records generated 
from each card is about 6, then 2,000 cards of input will pro- 
duce about 12,000 records of 125 characters each, a total of 

1.500.000 characters. The six areas of 50 tracks will hold 

1.875.000 characters at the blocking factor of 50 records per 

block used by KW0C. The difference between the capability of 
the work areas and the amount specified as the limit for input 
is "insurance." 

If necessary, the '’umber of tracks can be increased to 75 
or more with only a sma 1 JCL change. 
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SECTION V 



KWOC ECOLOGY 

There are several features of the KWOC index as implemented 
in the Washington Library which are vital to its continued effective 
operation. These are: 

(1) It is guardedly responsive, providing the best quality 
report indexing service available to meet the requirements within 
the resource limitations imposed. 

(2) It is very inexpensive to operate. 

O) It contains timely information. 

(4) It is fully documented. 

A number of hidden physio-socio-psychological threats to the current 
health and vigor of the system exist, however. In order to assure 
the continued usefulness of the KWOC index care must be exercised 
by all involved not to let these hidden threats surface to infect 
KWOC with a crippling illness. A few of the major threats to be 
avoided are described briefly below. 

The "left-in-the-sponge" syndrome often infects KWOC makers. 

One can always discover some little change that could be made to 
correct an error that got by last month or to improve the logic 
flow of the program. These changes require modifying the program 
and, once that has been started, one can always think of a few more 
"nice to have" capabilities (see "go-go- it is" below) and so on ad 
infinitum. Programming and debugging are the most expensive parts 
of the KWOC development. Re-programming opens up many uncertainties, 




Ho* 41 



causes extra shake-down runs and destroys the validity of the docu- 
mentation, It takes courage to wait it out, but the minor problems, 
like the surgeonb spowge, either melt away or pass some other way 
into oblivion. 

The "go-go-itis" virus is also contagious in the creative 
environment at MITRE. Programmers' pulse rates rise dramatically 
when they dream of all that delightful library data just sitting 
around in ihe KWOC files and what they could do with it. Hallucina- 
tions of on-line querying from desk top CRTs and instant responses 
pop into mind. No certain cure has been found for this yet; however, 
massive doses of "see what you can do with just the plain old listing" 
while bitter to the taste, have proven effective in many cases. 

The "aching backlog" is perhaps the most insidious illness 
which can befall the KWOC index. It is allegedly the cause of the 
paralysis of the prior system, Its onset is caused by a variety of 
agents including: "higher" priorities; set-aside-it is and; too- 
many-on-today 's-batch-to-do-osis. Failure to process into the 
KWOC index each day the information about the reports that arrive 
each day can cause a growing numbness of the system. Existence of 
any backlog means that the most recently acquired reports are not 
included in the listings. It is often these most recent which are 
the most useful to the staff. It is easy to maintain throughput 
in the system by injecting transfusions of extra help when It is 
required . 
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The last item which must be mentioned is "documentation 



aphasia." With the publication of this document all users of the 
KWOC index have one central place to look for information about 
its use. Despite the warnings, imprecations and bald threats from 
the system developers (who we will soon discover were not as omni- 
scient as we had thought) some changes to the KWOC system will even- 
tually be made. Some of these will be made to adjust to new require- 

ments or new resource situations, others will just be because "we 
prefer to do it that way - and it doesn’t make any difference does 

it?." Whatever the reason for the change, it must be documented . If 

changes are not documented sudden heart failure may result some day 
when the whole system shuts down because the person who was keeping 
track of all the changes stays home ill or is otherwise unavailable. 

A well placed innoculation with a recording needle is the remedy for 
the problem. All who change anything about the system or any of its 
procedure* must have the remedy applied promptly. Thus infused with 
the desire to record their change, they will send their information 
to: 

Miss Paula Strain 

MITRE Washington Library 
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APPENDIX II 



REFERENCE DATA FOR CODING KWOC INPUT 



Accession Number 

- The first notation made on each input card, starting in card 
column 1, will be the report accession number, 

A card sequence letter will be suffixed to the accession num- 
ber on all cards after the first card. These will start with A and, 
on succeeding cards letters B, C, D, etc. will be used as required. 

No letter beyond Q in the alphabet will be used for card sequencing. 
In accessioning MITRE-generated documents or others which have 
volumes, series, revisions, etc. in the accession number, use the fol- 
lowing abbreviations (without punctuation): 

V ... Volume 
S ... Series 
R ... Revision 
SP ... Supplement 
X ... Correction 
An example: WP1234V1S2R1SP3X2 

WP1234V1S2R1SP3X2A 
B 
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- The Technical I ts Section of the Library and Document 

Control Department will sign different numbers to the different 

volumes or supplements report from another agency than MITRE 

(i.e., one accession j o each volume or part). MITRE-generated 

documents will continue to use their own report number as an accessions 
number in the Library. In cases where the number runs beyond 19 
characters or there is other difficulty, the Technical Reports Staff 
will consult Miss Strain so that a consistent policy for the Library 
can be developed. 

- The Technical Reports Section will use the control numbers 
assigned by Document Control Department for classified documents 
rather than reaccessioning them with a new number. The acronym for 
the Issuing agency, which Document Control uses as the first part of 
the control number will, in the Library entry, be filed after the 
control number, rather than before, for a more meaningful sort by the 
computer. 

- Classified reports will show the classif ication symbol In 
parentheses, as part of the accession number character string after 
the card sequence letter. 

Examples : 

W1234/RAC(C) 

WP1279(S) 




53 



Title 



- The report title will be coded immediately after the accession 
number character string, starting on the first card. 

- Code just as in document, with minimal modifications. 

- If a word in the title is also assigned as an index term, the 
word in the title is coded with an asterisk following it. 

Example: 

A STUDY OF URBANIZATION* IN THE UNITED-STATES, PART-II 

URBANIZATION* is an index term as well as a title keyword. 

UNITED-STATES is hyphenated to prevent generation of sep- 
arate listings for both words. 

PART 2 is coded as PART- 2 to prevent separate listings 
of "PART 11 and " 2 

- Terms in the title within quotation marks or inside parentheses 
will be coded with the leftmost quotation mark or parenthesis omitted 
in order to avoid listing under ( in the sort sequence. 

Example: 

GROWTH OF URBANIZATION (URBGRO) IN M DISNEYLAND EAST 1 ' 
would be coded: 

GROWTH OF URBANIZATION URBGRO) IN DISNEYLAND EAST" 

- Normally, terms in titles will not be abbreviated. Where 
abbreviation is desired, only abbreviations from the Library Clerk's 
list of acceptable abbreviations will be used. This list is kept in 
her KWOC notebook. 
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Author 



- Each author's name is coded, with a dash preceding the first 
letter of the last name. Initials only follow the last name. No 
given names are spelled out. They are connected to the last name by 
periods so that the character-string is treated as a single group. 

Junior or III are added to the name entry after the last name and before 
the initials, vnly for authors of MITRE Papers. No titles are coded. 

Example: 

-HENDERSON. D.D. -MAISH.A.M. -MORAN. C.V.-PETTIBONE. JR. R.O. 

If the document lists a single author with ET AL ather 
than a number of names, ETAL is used. 

Example: 

-FLURY.W.R.ETAL 

A group author like SASM SUPPORT GROUP is coded as a 
single group with hyphens as connectors. 

Example: 

- SASM- S UPPORT-GROUP 

Index Terms 

- Index terms are coded with an asterisk directly following the 
last letter. 

COMPUTERS* PROGRAMMING* 

COMPUTER-PROGRAMMING* 

PROGRAMMING-LANGUAGES* 
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Date 



- Coded as last two digits of year, month number, and day, pre- 
ceded by a slash / 

(YYMMDD) 

30 December 1969 is /691230 
(with no blanks between characters) 

MITRE Department 

- Indicate Department by .D or .C followed by hyphen followed by 
Department number in two digits. 

• D-02 
. D-24 
.C-70 

MITRE Project Number 

- Project/Task number preceded by percent sign 

X31.5.0 

- DCA Program Line Item (PL1) numbers are appended to the Task 
number by a slash / 

X31. 5. 0/20601. 123AA 

AD Number , also 

NASA and CFSTI Numbers 

- Preceded by * sign. No embedded blanks or commas 

“AD896543 -N70-14321 -PB1702A6 

Corporate Report Number 

- Preceded by colon: No embedded blanks or commas 

: JWGA-100-54 
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Corporate Source 

- Preceded by * linked by - if required 

*RAND 

*ARMY-AUTOMOTIVE-COMMAND 

- Abbreviations are being used for certain corporate sources 
whose reports make up a large number of entries in the collection. 
The complete list of corporate source conventions is maintained by 
the Library Aide in her KWOC notebook. 
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APPENDIX III 



RUN PROCEDURES 

Monthly Run Procedure (May be run less frequently) 

- On Thursday afternoon take the most recent week's input cards 
just returned from keypunch and submit them to the Computer Center as 
a job with the CHECKOUT procedure. (This is the same as the weekly 
run procedure but is run with only one. week’s input.) See Figure III-l 
for deck setup. 

- Friday morning: get the CHECKOUT output listing. Make the 

corrections to the cards, and then put them in the deck with all the 
other cards from the past three weeks. This has to be done by 5:00 
p.m. Friday because computer time is allotted for the run on Sunday 
night. Do not add any new cards after finding the ones that need cor- 
rection. Any late keypunching must await the next week's run. 

- Submit the data cards with the appropriate control cards for 
the oultistep job CUMULIST, JOB 1 of 2. (See Figure I1I-2.) You have 
to specify on the Job Request Card that output must be suitable for 
reproduction. 

The entries in columns 21-26 will cause suppression of any of 
the portions of the listing related to contract number. Department, Cor- 
porate Source, Date, Project, Corporate Report No. and AD No. Only the 
basic bibliographic entry, the author listing, and the subject index 
will be printed. 
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FIGURE IlH 

CHECKOUT PROCEDURE INPUT 
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FIGURE Ul-2 
CUMULIST PROCEDURE 
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FIGURE HI-2 (COMT'O) 
CUMULIST PROCEDURE 



The "suppress" control card for the print program in this run 
is keypunched as follows: 

Columns 1-20, date desired on printout; e.g., 31 March 1971 
Column 21 + 

Column 22 . 

Column 23 * 

Column 24 / 

Column 25 % 

Column 26 : 

Column 27 ■ 

Be sure to specify on the Job Request that the printing is to 
be reproduced. If more than one copy is desired, specify "Repeat Print." 
This output is the monthly Accessions List. 

- For JOB 2, PRINT, which is run immediately behind JOB 1, the 
control card contains the usual information and the extra notations "To 
be printed on striped paper" and "Repeat Print 3 copies." The output of 
this run is the KWOC listing which is put out for users in the Library. 
See Figure III-3 for input deck setup. The control card for JOB 2 is 
keypunched as follows: 

Columns 1-20 contain the date desired on printout 
Column 21 + 

All other columns are left blank. The + in column 21 will cause 
the contract number portion of the listing to be suppressed. (Column 21 
is the only column currently used.) 




62 



Weekly Run Procedure 



• Monday through Thursday: code input data for all the accessions 

in one week; get the data keypunched, a few sheets at a time, so it is 
done by Thursday afternoon* Also, correct previous week’s data cards 
on Monday. 
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Monday: have the cards sorted alphabetically on columns 7-1 

so that the corrections can be made\ easily. 

Friday afternoon: submitthe computer job, with request card 

and control cards for the procedure called WEEKLY; computer time is 
reserved for Sunday night/Monday morning (See Figure III-4 for the 
job deck setup). The "suppress" control card is keypunched as follows: 

Columns 1-20, Date (enter in numeric form or with months 
spelled ouc)< * 

Column 21, ® sign to suppress printing of contract number listing. 

Columns 22-28, other characters that designate parts of the 
listing you want to suppress (Examples: colon to suppress corporate 

report number, period to suppress department number), 

Monday morning: pick up the weekly cumulative acquisition print- 

out listings. Check them for coding errors. Make a list of corrections. 

Put the new listings out for library use by noon Monday. 

Make keypunch corrections to cards and examine them to make sure the 
punches are not "off gauge." Duplicating on the keypunch sometimes 
introduces punching position of alignment errors into cards that were 
previously all right. Insert corrections into the deck and hold them 
for the next weekly run. The corrected cards will appear in the fol- 
lowing week’s run. 

Second and third week, add new punched cards to the back of 
deck after they have been sorted together, alphabetically, on columns 
7 through 1. The sorting is for convenience in finding the cards that 
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FIGURE 111-4 

WEEKLY PROCEDURE INPUT 



need correction. There is no program requirement for the cards to be 
in order. DO NOT leave Job Control cards in the data card deck when 
the deck is sorted. 
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FIGURE IV- ! 

PROCEDURE FLOW CHART FOR CUMULIST AND PRINT JOB 
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FIGURE IV -2 

PROGRAM FLOW CHART FOR KWOC I 
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FIGURE IV-2(C0NT'0) 
PROGRAM FLOW CHART FOR KWOC I 
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FIGURE IV -3 

PROGRAM FLOW CHART FOR KWOC 2 PROGRAM 
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FIGURE V-l (CONTINUED) 
tcwoc-l PROGRAM LISTING 
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FICURE V-l (CONTINUED) 
KWOC-I PROGRAM LISTING 
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FIGURE VH (CONTINUED) 
KMOC-I PROGRAM LISTING 
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FIGURE VH (CONTINUED) 
KWOC-I PROGRAM LISTING 
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FIGURE VH (CONCLUDED) 
KWOC-I PROGRAM LISTING 
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FIGURE V-2 (CONTINUED) 
KWOC-2 PROGRAM LISTING 
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FIGURE V-2 (CONTINUED! 
KWOC-2 PROGRAM LISTING 
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FIGURE V-2 (CONCLUDED) 
KWOC-2 PROGRAM LISTING 
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